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This study determines whether. an abbreviated test 
administered in September, could predict achievement rati 
d of kindergarten in June of a group of kindergarten / 
in an elementary school. An additional purpose was to^^- 
a prevention program on a random sample of predicted 
children in this group and to evaluate the test outcomes 
he possible ameliorative effects of treatment. The sample 

of 28 black students and 10 U white students who entered 
ten in September 1974. The results provide additional 
or the predictive validity of this abbreviated scfe,ening 
The major reservation concerning the results of this'' stud 
ntative", if not premature, state of the achievement 
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An abb/viated behavioral- screening battery has been developed 
which is d/signed to detect those children... (white luales) during kinder- 
garten /o in four years wil 1 become si.v..erely. d.isabled_^ 

reader's (Satz & Friel , 1973; Satz & Friel 1974; Satz. Friel & Rudegeair, 
r97;si Standardization of the test battery was based on the total popu- 
■Ution of white boys- (N=497) who started kindergartens the fall of 1970 : 
"in Alachua "county; Florida. Twenty elementary schools (14 urban, 6 rural ) 
comprised this population. The pred/ctive tests . (N=8), given during early . 
kindergarten, were later validated/against independent reading criteria 
at the end of Grades 1 (1972). 2/(1973) and 3 (1974). ^ Because of careful 
'tracking procedures, the follpwiup val idation studies were based on ap- 
proximately 95 percent of the. original population-. A separate cross vali- 
dation study was recently completed based on a sample of whUe boys (N=181) 
4om five of the urban 4:hools (Alachua County) who began kindergarten in 
• 1971 and whose reading scores were assessed three years later at the end of 
Grade 2 (1974). The children in this sample were predicted on the basis of 
their abbreviated test battery scores (kindergarten) and the discriminant 
- function weights derived from the original standardization population at 
the end of Grade 2. 

: The results Of the precedingjyal idation studies, based ^_discriminant_ 
function analyses, revealed that the tests given during kindergarten consis-. 
tently identified over 90 percent of the c-hildren destined to become^ severely 
disabled or superior readers in later years. Stepwise regressio°n analyses 
also revealed that a small number of tests consistently ranked highest in pre- 
dicting reading outcome's in later grades (1-3). These tests (Finger Locaii- 
zation. Alphabet Recitation. Recogni tion-Discrimination) loaded on a primary 
factor which was labeled a measure of sensori-percepf 5l-motor- 
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mniiiiionic ajjility. This' factor is 1 l-U to tap those skills which aro in 
primary ascendancy during preschool years (kindergarten) and which are 
" postulated to be crucial tb the early phases- of reading (Satz &"^Van Nos.- 
trand, 1972).^ A major review pf the theory guiding this research and 
the results are reported in a recent' chapter by Satz, Friel & Rudegeair 
(19''5). The results, in summary, lend support to the validity and utility 
of an early detection or 'yarning system' that could be administered eco- 
nOinically before the chi^d'' begi ns formal reading--at a time when his cen- 
tral nervous system may be more plastic and responsive to change-i-arid at a 
time when he is Tess; subject to the shattering effects of repeated academ- 
ic failure. 

Despite the promising nature of this early detection research, more 

cross'validation'work is needed to ensure that trul.y high risk children 

are identified /ipbr placement into early prevention programs. Fo,r example, 

IJ ' N . 

,if the tests, liave a high number of false positives, then serious risks of 

mislabeling could occur which would incorrectly place potentially normal 
readers" into early treatment programs!. Similarly, if the false negative 
rates .are high, then .unnecessary coslfs may result by instituting treatment 
p»=ograms which exclude the- majority of high risk children. . An additional " 
-need for further cross validation study is to determine whether the battery 
is applicable to girls (white & black) and to black boys. 

The purpose of the present study was to determine whether the abbrevi- 
ated test battery, given in September, could predict achievement ratings at 
the end of kindergarten (June) on a new group of kindergarten children 
(boys, girls, black, white) in an entire elementary school. An additional 
purpose was to institute a prevention program on a random sample of predic- 
ted high risk children in this group and to evaluate the test outcomes de- 
0 ■ . ' 
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spite the possible' amel iorative" effects of treatment. This design was 
felt to provide a more powerful test of the predictive efficiency of 
the abbreviated battery. 

Method 

Subjects 

The sample consistedof 28 black Ss (13 boys, 15 girls) and 104 
white Ss (54 boys, 50 girls) far a total of 132 Ss. This group repre- 
sented all those S_s who entered kindergarten in September, 1974 at 
Stephen Foster School (mean age=54.1 months). Additional age and SES 
information is presented in Table 1 for outcome achievement groups at 
the end of kindergarten. 
Predictor Te sts 

The abbreviated test battery consisted of seven tests and one 
non-test variable (SES)., The seven tests were: Finger localization. 
Alphabet Recitation, Recognition-Discrimination, Peabody IQ, Beery (VMl), 
Wepman Auditory Discrimination and Dichotic-Listening (Total R+L Score). 
A recent factor analysis of this battery, based on the original standardi- 
zation and cross validation groups (white male, N=578), revealed two main 
' factors using an oblique solution.^ Factor I, which comprised most of 
the verbal tests, included Peabody IQ, Alphabet Recitation, Auditory Dis- 
crimination and Dichotic-Listening. Factor II, which comprised most of 
the nonverbal perceptual tests, included Recognition-Discrimination, 
Beery (VMI) and Finger Localization. These factor, loadings are different 
than those reported for the standard 14 test batvery (Satz & Friel, 1973). 
Test Proc edure , . 

The tests were given during two weeks in September in office space 
provided by the eleinentjiry school. Intervention procedures were carried 
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out in a mobile laboratory parked or. school grounds. All testi/ng\^as 
adininistered individually by the re^-,oarch staff and took approximately 
"50 minutes per child. . At the conclusion of. the test admi nist/ration. 
all results were analysed via two computer program. (DSCRIMj and CLASIFY) 
which utilized the lambda weights^ d- -ived from the pooled s/tandardization 
and cross validation population of white boys (N=633). "Ba/sed on this 
analysis, 44 children were identified as severe high risk/(t) and 28 of 
'them were randomly assigned to two treatment groups for ihe duration of 
the school year.. The remainijig Ss were placed into a aon-treatment group 
(N=16).^ This random assignment yielded approximately/ equal numbers of 
.children (by race and sex) in each of the three groups. To prevent indi- 
vidual labeling of children, all test information was withheld from, the 
teachers during the school year. As a further cont/ol, selected children 
from the other predicted groups (low risk) were periodically brought to 
the trailer during the year for additional research study. 
Achievem_ejTt Critter ioj-Ratjj^ / 

The achievement ratings were obtained at the end of the school year 
(May-June) for each child. The ratings were mide by the individual class- 
room teachers who had taught the children during the school ^year.. An ovrr- 
alT achievement rather than reading criterioi was used because of difficulty 
in assessing reading competency at this dqej A ten-point interval scale 
was used to assign the children into one of four different ^achievement 
.. groups: Severe (0-4), Mild (5-6), AveraJ (7-9) and Superior (1 0) . Cri- ^ 
terion information was available for 128 bf the original children, or 97% of 
the sample: . Severe (N-12), Mild (N=33)/ Average (N=63) and Superior (N=20).^ 
~ This criterion evaluation, while admittedly tentative, if not premature, 
, h^s nevertheless been shown to hold uf/ in later years when more objective 



6 



reading measures are available (5.aL/, Friel & Rudegeair, 1 975).. 

R(.!Siii_Ls_ 

Cross Val i dati on y'' 

Ito]^ ind Standi ^^"^ '"^^"^ and standard deviations 

of the tests give^/in September arc presented i'ri Table 1 for each of the 
four outcome achievement groups (June). .Inspection of this table reveals 
an increasing/evel of performance across tests as one proceeds from the 
Severe to Su/erior groups.' For example, mean Peabody IQ ranged from 77.7 
in the Sej/re group to 113.6 in the Superior group. Similarly, perfor- 
mance on/the Beery (VMI) ranged from a mean score of 49.6 months in the 
'. Severe/group to 67.4 months in the Superior group". This means that the 
Seve/e group was lagging almost 13 months behind'their chronological atje ' 
W.5) whereas/^'he Superior group was advanced almost three and a half 
months beyond /their chronological age (64.1). 



/ Ipser't Table 1 about here 



Classification^ Subjects were then classified into the predicted 
..achievement, groups Based on the tests given in September plus the weights 
derived. from the original standardization and cross validation groups. 
The results, for the final outcome achievement groups" (.June), are presented 
in Table 2 for a 2x4 matrix. That is. the composite test predictions (Septem- 
. ber) are reduced to high risk (t & +) and low risk (I & signs and are 
represented by rows whereas the achievement outcomes (June) are represented 
by columns, "inspection of this table reveals that .the tests correctly pre- 
dicted lOOX of the Severe and Superior groups, while mi sclassifying 21% of - 
the Mild group (N= 7) and 41% of the Average group^ (h=26) . The-Qverall hit- 
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r.iLo was 9e)/128=74% for this.icross valj^tion sample. 

/ / 

Insert Table/7 about here ^ ^ 

' . ' • . - - - ' 

The classification outcomes for a 4x4 matrix are presented in -Table 
3. This table represents a more meaningful presentation of results because 
it reveals the outcomes for tne severe high ri:>k predictions (|) which for- 
med the decisional basis for the treatment programs (experimental and con- 
trol) in September (Grade K), The only difference in this table is that 
the composite. predictions are reduced to four levels (rows') which does not 
alter the overall hit-rate. . Inspection of this taDle reveals thatjnebc 
severe high risk indicators {%) Detected 100% of the children who at tne 
end- of kindergarten fell in the Severe group andSSX of those who fell in 
tne Mild group. Although it mi sclassif ied 13 children (20%) who fell in 
the Average group at the end of kindergarten, it did not misclassify any 
children who later fell in'the Superior group. \ 



Insert Table 3 about here 



In other words, there were only 13 misclassification errors using . 
the severe high risk signs, all of which were confined to the Average 
• group, NO children with these risk signs .^nded up in the Superior group.; 
- Moreover, when these children with .severe risk signs were examined for 
treatment group assignment (experimcntal\s . control)- it was fdund that 
eight of the 13 mi sclassif ied children were in treatment groups, which re- 
, duces the predictive error to only five children-again. none of whom 

,8 ' •■ • 

ended up in the Superior group! - 

erJc . ^ 



rate to 84.45i. 

sciission 

— The preceding results provide additional cross validational support 
for the predictive efficiency tf this abbreviated screening battery. The 
tests given ^t the beginning of Grade K, based on the weights derived from 
the standardization and cross val idation groups (white males), correctly 
predicted the achievement outcomes of a new sampleof children (boys, 

-girls, black, white) at the end of Grade K--particularly those destined 
to extremes in the achievement distribution. Furthermore, these predictions 
"'held up despite the fact that the sample varied, in terms of race and sex 
and the fact that a treatment program was 'instituted for a majority of the 
severe high risk children. In fact, it was shown that the validity of the 

. severe high risk test sign {t) was extremely high; it detected 100% of the 
children who fell -in the Severe group ^at the end of kindergarten^while mis- 
classifying only 20% of the children (N=13) who fell in the Average group 
(Table 3). However, this test sign (:j:=) misclassified no children who, at 
the end of kindergarten, fell in the Superior group. Moreover, of the 13 
high ris'k .children (20%) who fell in the Average group . at the end of kin- 
dergarten, eight of them were involved in individual treatment programs,^ 
throughout the entire year which suggests that treatment per se may have 
altered the high risk signs (false positives) seen at the beginning of 
kindergarten," These findings, in summary, reduce the risk, associated 
with false positive decision errors without increasing the false negative 
errors; i.e., 100% of the, Severe cases were correctly detected. 

The major reservation with the present results concerns the tenta- 
tive, if not premature state of the achievement criterion, The interval 
of nine, months between tests and criterion probes, 'particularly during 
kindergarten, provides only tentative information, at best. 
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on Lho validity of the current achievement ratings. Although predictions . 
of learning abil i ty -based upon teacher judgments have been shown to be 

• surprisingly accurate (Austin & Morrison. 1963; Kermoian, 1962; Feshbach, 
Adelman Fuller, 1974), the fear of mislabel ing may .increase the inci- 
dence of false negative errors, particularly for the severely high risk 
child'. This^ problem was seen recently in an unpublished study which com- 
pared teacher predictions (end of .Grade K) and test predictions (begin- 
ning Grade K") to reading outcomes at the end of Grade 2. The study was 
based on the third year, follow-up of the population of white boys who 

• began kindergarten in Alachua County, Florida (1970) and who represent 
the standardization group for the present abbreviated battery (Satz, 
Friel & Rudegeair, 1975). The results showed that while the overall ac- ' 
curacy of kindergarten teacher predictions was as high as the tests (ap- 

'prox-imately 80%) , the.detection of the severely high risk, child (Grade 2). 
was much lower when predicted by the teachers. The teachers identified 
only 19% of these children, whereas the tests detected 75% of them. 
other words, the overall teacher predictions were spuriously inflated by 
'good^oOtcome' forecasts When the base rates favored such outcomes (by 4:1). 
Howlier. whennhey predicted severe outcomes (which Was rare), their accuracy 
was extremely high (approximately 90%). 

This unpublished finding is relevant to the current "kindergarten ^ 
■ achievement criterion. It suggests that those children designated Severe 
may in fact. turn out to be so. but not those classified as Average, many of ^ 
whom in later years may fall in the.iMild to Severe achievement groups. If 
so, it would further lower "the false positive rate. of the abbreviated- test 
battery, especially when/predictions are foUowed-up in later years. This 
is indeed what happened when the test^predictions (Grade K) were evalua.ted 

er|c ' 10 



against achievement ^iteria at the end of Grade K (Satz & Friel, 1 973), 
o Grade 1 (Satz, Friel, 1974), Grade Z (Satz, Friel & Goebel , 1975) and 
Grade 3' (Satz, Friel & Rudegeair, 1975). In later years, the predictive ^ 
accuracy of the tests (Grade K) increased with incremental reductions in the 
' false positive rate--again, due presumably to increased validity in the 
criterion achievement; measures. 

This problem regarding preliminary criterion specification (at Grade 
K) may also explain, in part, the change in the discriminative ranking 
of the tests. It was' shown' that the four tests which ranked highest in 
the stepwise regression analysis were Peabody IQ, Alphabet Recitation, 
Dichotic Listening and SES (three of which loaded on the verbal factor). 
This discriminative ranking, however, contrasted with the triad of Finger, 
Localization, Alphabet Recitation, and Recognition-Discrimination which 
consistently ranked highest in later years (Grades 1-3) and which loaded 
an the sensor i -perceptual -motor factor. Although the change in discrimi- 
native ranking may in part reflect changes in the current school sample 
(i.e., girls & blacks), the change more likely reflects the nature of the- 
kindergarten criterion. The reason is that approximately the same dis- 
criminative ranking occurred in the standardization population at the end 
of Grade K (Satz & Friel, 1973) [ref. footnote 3]. 

The preceding explanations* of course, must ultimately rest on the 
follow-up evaluations at the end of Grades 1 and 2 with this cross valida- 
tion sample.^ If confirmed, they will provide additional support for the 
utility of an early warning system that could be administered economically 
before the child begins formal reading--at a^time when the central nervous 
system may be more plastic and responsive to change— and at a time when the 
" child is less subject to the shattering effects of repeated academic failure. 

ErJc 11 ^ 
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The ultimate task for education is to prevent the needless suffering sthat 
results when a systan fails tq develop a valid' early screening program for 
. its high risk children. 
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• ^-1 ins research was supported in part by funds from the NaLi<.n.i 
institutes of Health (NS08208) and the National Institutes of Mental 
Health (MH19415). 

■ ^The original screening battery consisted of 14 tests (Satz, Friel & 
Rudegeair, 1975). 

hhis discriminative ranking however, was not observed d^unng the 
■f/st follow-up evaluation or achiovcmont outcomes at the end or kinci.r- 
"Crten (Satz & Friel, 1973). The ranking in this preliminary study was as 
follows: Finger Localization. SES, Dichotic Listening (Total) and Peabody 

" Consult Satz, Friel & Rudegeair (1975) and Satz & Friel (1973) for ad 
ditional information concerning description'and selection of the test 
battery. . 

• ^The factor analysis was computed by Jack Fletcher. , , 

^written by D. 'o. Veldman, University of Texas% 1967; modified by 
X A. Goebel. Statistical analyses performed on IBM System/370-165. 
Northeast Re,gi,onal Data Center, Gainesville. Florida. 

, ^he treatment program and' rational e is not relevant to the present 
study but will be discussed • in a later study after follow-up evaluations 
are made ^in Grade 1 

8lt was also found that six of the 12 children 'in the Severe group 
and 11 of the 19 children in the Mild group were in treatment groups • 
throughout the kindergarten year. , 
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.'^Additional cross-cultural information on the predictive validity 
of this abbreviated battery will be available in the fall of. 1975. The ' 
battery W4S administered to a representative sample of white boys (N=450) 
who began elementary school in the Catholic System in Perth, Australia 
in February, 1975. This study will also investigate whether a further 
abbreviation of the test battery (N=o) will predict as well as the stan- 
dard abbreviation (N=8). Unpublished results have just shown that the 
modified abbreviation (excluding Dichotic Listening and Au.ditory Dis- 
crimination) predicts as well as the standard abbreviation. In fact, the 
modified abbreviation yielded a much lower, false positive rate when applied 
to the current cross val ida tion'sampl e. if these results can be repljcated 
on the Australian sample, they will substantially reduce the gost of 
administration for mass screening. 
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Means and Standard Deviations of Abbreviated Test Battery 
(Sept.. Kindergarten) for Achievement Groups (June. 
.;, Kindergarten) Cross Validation Sample II 



Criterion Achievement Groups 



Tests 



Finger 

Localization 

Alphabet 
Recitation 

Recognition- 
Discrimination 

Pea body IQ 
5. Beery (VMI) 



6. Auditory- ; 
Discrimination 

7. Dichotic 



Severe 
A ne=62.5 _ 



28.6 
(5.2) 

11.2 
(8.7) 

6.5 

(2.2) 

77.7 
(16.2) 

' 49.6. 
(9.1) 

1.0 
(0.6) 

58.7 



Listening (Total) (15.9) 



•8', Socio-economic 
Status 



I .4 
(0.5) 



(N=33) 
Aqe=60.8 



34.7 
(7:i) 

18.5 
.(6.8) 

7:5 
= (2.3), 

89.2 
(19.0). 

53.2 
(9.6) 

1.^3 
(0.4) 

70.4 
(12.2) 

.■ 1.7 
(0.5) 



Average 

~T^63T 
Aqe=63.0 

37.8 
(5.8) 

. 22.1 
(6.7) 

8.7 

■ (2:8) 

103.4 
" 05.3) , 

'61 .6 
(9.7) 

1.5 
(0.3) . 

73.4 
(14.0) 

1.9 
(0.3) 



Siiperi 0£ 
~(N=207 
_A3e^64_J_ 



40.5 
(3.9) 

25.3 
(2.3) 

10.2 
(2.8) 

113.6 
(7.5) 

67 ..4 
(10.5). 

1 .6 
(0.3) 

&D.9 
(6.3) 

2.0 
(0.0) 



a sample = all Kindergarten classes (Stephen Foster Schdol ) including girls 
and boys (white and black). - / 

i 

^ Age in months , / 

^ Score in months " / , ■ - " 
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• f • , ... 

Predictive Classification of Cross Validation Sample II (Sept-, 
Grade K) into Achievement Groups (June, Grade K) based on 
Discriminant Function Weights (Abbreviated .Battery) 
of Standardization Population (N=639)^' 

(2x4 Matrix) 









Criterion Achievement Groups 




Composite 
Oiscrimi nant 
Scores 








* 






Severe 


Mild 


Average 


Superior 




N 


* 

12 


26 


26 


0 . 


+ 


% 


(100) 


(80) 


■ (41) 


(0) 




N 


0 


7 


37 


20 




% ' , 


• (0) . 


(20) 


(59) 


(IOC)' 


' T 




° 12 

r 


33 


G3 


2& 



^.Population = Standardization Group (Grades K-2) and Cross Validation 
Group I (Grades K-2) . 

\ Total. Hits'= l5/12a:'= 74%. 



Table 3 



Predictive Classification „of Cross Validation Sample II (Sept., 
Grade K) into Achievement Groups (June, Grade K) based on 
Discriminant Function Weights (Abbreviated Battery) 
of Standardization Population (N=639). 

(4x4 Matrix) ■ 

^ • • 

Criterion Achievement Groups 



12 



Conipos'lite 
Discriminant 
Scores 


Severe 


Mild 


Average 


Superior 


T 


+ + 


- 12- 


.- ""^ 


. 13 


0 


44 


+ 


0 


7 


-13 


0 


:?.o 




' 0 


5 • 


^ . 27 


' 10 


42 




0 . 


2 


10 

-'J 


10 


' 22 



-33 



63 



20 



128 



^ Population = Standardiz/tion. Group (Grades K-2) and Cross.^Valida- 
tion Group I (Grades " , ^ . ^ 
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Table 4 



Predictive Classification of Cross Validation Sample II 
(Sept., Grade K) into Achievement^ Groups (June, Grade K) 

based on Discriminant Function Composite 

a ' *^ 

Scores (Sample 11) . - 

(2 x>4 Matrix) 



Composite 
Test 
Scores 



Criterion Achievement Groups 



Severe 



Mild 



Average / Superior 





• N 


• 

12 , 


25 ° 


1 2 


0 




% 


O.oo) 

■i 


(76.) 


(19) 


(0) 




N / 


0 


8 . , 


■ 51 


20 


0 


% 


.(0)- \ 


(24) 


(81) ■ 


■ (100) ' 


t 




12 


33 


63 


20 



Total Hits = 108/128 = 84% 



r 



Table 5 ■■ ' 

Discriminative Rankin/ and tumulative Hit Classification 
of Abbreviated /Test Battery based on Stepwise 
Discriir/inant Function' Analysis of 
Cros/S Validation Sample II . 



Ranked 
Variables, 



1. Peabody IQ/ 

2. Alphabet J ' ' 
Recitation 

3. Dichotia ; 
Listening (Total) 

■4. SES, / ■ 
5. Residual Tests 



Factor 



L-II 



Cumulative 
Hits {%) 



75.8 

'81 '3 

77.3. 

80.5 
84.4 



ERIC 



20 



